A Comparison of Reliability Coefficients for Ordinal Rating Scales

نویسندگان

چکیده

Abstract Kappa coefficients are commonly used for quantifying reliability on a categorical scale, whereas correlation applied to assess an interval scale. Both types of can be the ordinal rating scales. In this study, we compare seven scales: kappa included Cohen’s kappa, linearly weighted and quadratically kappa; intraclass ICC(3,1), Pearson’s correlation, Spearman’s rho, Kendall’s tau-b. The primary goal is provide thorough understanding these such that researcher make sensible choice A second aim find out whether coefficient matters. We studied what extent reach same conclusions about inter-rater with different coefficients, measure agreement in similar way, using analytic methods, simulated empirical data. Using analytical it shown differences between quadratic Pearson correlations increase if becomes larger. Differences three generally small rater means variances small. Furthermore, data, all tend raters increases. Moreover, data conclusion was reached virtually cases four coefficients. addition, as any great number times. Hence, does not really matter which five used. way: their values very highly correlated study.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparison between Two Rating Scales for Perceived Exertion

Thirtytwo subjects,16 men and 16 women, participated in an experiment to compare two commonly used rating scales for perception of exertion, viz. the Borg RPE scale and the Borg CR10 scale (see, e.g., Borg, 1998). One group of 8 men and 8 women used each scale. Workloads were increased every minute, with 15 W for men, and with 10 W for women, to a voluntary maximum. With a basic perceptual nois...

متن کامل

Comment on "Inter-rater reliability of delirium rating scales".

lation and found a of 0.91, which is one of the many languages that this instrument is now translated into allowing international use in monitoring. Secondly, we have conducted a large-scale implementation study of the CAM-ICU. This year-long quality assurance/quality improvement project included 55 nurses, 711 patients, and two different medical centers [5] . Data were recorded prospectively a...

متن کامل

A Comparison between School and Home Rating Scales and Reliability-Validity of the Scales-the Scales for Diagnosing Attention-Deficit/Hyperactivity Disorder.

INTRODUCTION The purpose of the present research is to compare the Turkish translations of school and home versions of the Scales for Diagnosing Attention-Deficit/Hyperactivity Disorder (SCALES) developed by Ryser and McConnell with respect to age and gender and to examine the correlation between the two scales. METHOD The research was conducted with 102 teachers and parents of 891 children a...

متن کامل

Rating scales for neurologists.

Correspondence to: Dr Jeremy Hobart, Department of Clinical Neurosciences, Peninsula Medical School, Derriford Hospital, Plymouth PL6 8DH, UK; Jeremy.Hobart@ phnt.swest.nhs.uk _________________________ A neurologist once told me that he found the subject of rating scales ‘‘exceedingly dull’’, while another found the area ‘‘abstruse’’. I have therefore attempted to produce an overview that is he...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Classification

سال: 2021

ISSN: ['0176-4268', '1432-1343']

DOI: https://doi.org/10.1007/s00357-021-09386-5